BMC Genomic Data
○ Springer Science and Business Media LLC
Preprints posted in the last 90 days, ranked by how well they match BMC Genomic Data's content profile, based on 12 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.
Shen, J.; Tang, S.; Xia, Y.; Qin, J.; Xu, H.; Tan, Z.
Show abstract
BackgroundConventional models of human ribosomal DNA (rDNA) array organization have historically depended on transcription-centric boundaries, partitioning the unit into a [~]13 kb rDNA transcription region and a monolithic [~]31 kb intergenic spacer (IGS). While our previous identification of Duplication Segment Units (DSUs) mapped these arrays based on an intuitive analysis of the microsatellite density landscape of the complete reference human genome, our present deep mining of this landscape has revealed a more accurate rDNA Gene Unit Pattern. Methods & ResultsIn this study, we conducted a deep mining analysis of our previously established microsatellite density landscape of the T2T-CHM13 assembly, focusing specifically on nucleolar organizing regions (NORs). We suggest a more accurate rDNA Gene Unit Pattern containing a (CTTT)n microsatellite aggregation ahead of the rDNA gene and a (CT)n microsatellite aggregation behind the gene, rather than a pattern featuring an IGS region inserted between two rDNA genes. ConclusionsA correct rDNA gene pattern of the human genome probably includes a (CTTT)n microsatellite aggregation ahead of the gene and a (CT)n microsatellite aggregation behind it, which possibly constitute cis- and trans-regulating regions; the (CTTT)n and (CT)n microsatellite aggregations may provide two different local stable DNA structures for regulatory protein binding.
Axelsson, J.; Bruhn-Olszewska, B.; Sarkysian, D.; Markljung, E.; Horbacz, M.; Pla, I.; Sanchez, A.; Nenonen, H.; Elenkov, A.; Dumanski, J. P.; Giwercman, A.
Show abstract
Cancer-related genomic instability (GI) may cause genetic alterations in spermatozoa, implying health issues not only in cancer survivors, but also in their children [1, 2]. We therefore studied Loss of Y chromosome (LOY), considered as hallmark of GI [3-15], in spermatozoa and blood from survivors of childhood and testicular cancer (CC, TC), and controls (CTRL). We found that LOY was statistically significantly more frequent in spermatozoa from cancer survivors than in controls (Odds Ratio [OR]=2.2 for CC vs. CTRL and OR=2.4 for TC vs. CTRL). Furthermore, LOY was about an order of magnitude more prevalent in spermatozoa than in blood among 18-53-year-old males within all cohorts. Our findings suggest that LOY in spermatozoa might be a clinically useful marker of GI, reduced fertility and disease predisposition in males. Introducing LOY in spermatozoa as a biomarker opens a new research avenue into disease prevention and the causes and consequences of LOY.
Sattler, M. C.; Singh, A.; Bass, H. W.; Mondin, M.
Show abstract
BackgroundMaize knobs are regions of constitutive heterochromatin that are readily identified in both meiotic and somatic chromosomes. These structures have been characterized as stable throughout the cell cycle, exhibiting late replication during the S-phase, and are composed of two specific families of highly repetitive DNA sequences: K180 and TR-1. Although widely used as cytogenetic markers due to their variability in number and chromosomal position across inbred lines, hybrids, and landraces, little is known about their chromatin structure and dynamics. In this study, we analyzed chromatin accessibility of knobs using DNS-seq data across four maize tissues representing distinct developmental stages. ResultsOur results reveal that K180 knobs exhibit tissue-specific variation in chromatin accessibility, transitioning between open and closed states during development. In contrast, the TR-1 knob of chromosome 4 remained consistently inaccessible across all tissues analyzed. A knob composed of both K180, and TR-1 further supported this observation, with only the K180 region showing dynamic accessibility. To validate these findings, we also analyzed other repetitive regions such as centromeres, which showed a uniformly closed chromatin structure similar to TR-1. These results suggest a unique developmental modulation of chromatin accessibility associated with K180 repeats. While the chromatin accessibility of knobs does not reach the levels observed at Transcription Start Sites (TSS), the comparison among different classes of repetitive DNA within maize constitutive heterochromatin provides compelling evidence for sequence-specific and tissue-specific chromatin dynamics. ConclusionsOur findings uncover a previously unrecognized property of maize knobs and establish a reference for future studies on chromatin organization and epigenetic regulation of repetitive DNA in plant genomes.
Messa, P. E.; Warren, C. L.; Nicol, N. R.; Pearson, K. S.; Peters, J. P.; Fowler, A. M.; Alarid, E. T.; Ozers, M. S.
Show abstract
Grainyhead-like 2 (GRHL2) is an epithelial transcription factor with context-dependent regulatory roles, yet the sequence rules governing its DNA recognition remain incompletely defined. In this study, a high-density genomic Specificity and Affinity for Protein (SNAP) DNA-binding array containing 772,732 tiled probes derived from GRHL2 ChIP-seq regions was used to resolve GRHL2 binding specificity at 6 base pair resolution across genomic sequences. From high-affinity probes, de novo motif analysis recovered the canonical 5-AACCGGTT-3 motif. Sequence specificity landscapes revealed a stepwise reduction in binding as mismatches were introduced, with the strongest effects at the C (position 3) and G (position 6) within the motif, greater tolerance at the central CG dinucleotide, and intermediate tolerance at the A/T bases at the motif edges. This analysis also demonstrated the influence of nearby flanking sequences. Extended motif and spacing analyses indicated dimeric binding at paired motifs, with periodic helical spacing consistent with interactions on the same face of the DNA helix. Integration of SNAP array binding with ChIP-seq data distinguished direct, motif-encoded GRHL2 occupancy from indirect, cofactor-mediated recruitment at genomic sites. These results define the sequence specificity of GRHL2 interactions with variations in the DNA consensus motif and flanking sequences within an endogenous genomic context. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=77 SRC="FIGDIR/small/719077v1_ufig1.gif" ALT="Figure 1"> View larger version (21K): org.highwire.dtl.DTLVardef@237363org.highwire.dtl.DTLVardef@16c97d7org.highwire.dtl.DTLVardef@64b251org.highwire.dtl.DTLVardef@f72090_HPS_FORMAT_FIGEXP M_FIG C_FIG
Tokmakov, A. A.
Show abstract
Xenopus is a genus of entirely aquatic frogs found in sub-Saharan Africa. Currently, the complete genomes of two species within the Xenopus genus, Xenopus laevis and Xenopus tropicalis, have been fully sequenced, annotated, and made publicly available. The two species inhabit markedly different environments: X. tropicalis lives in the hot, equatorial regions of Africa, whereas X. laevis resides in the cooler climates of southern Africa. In the present study, mutational profiling, comparative homology modeling, and computational bioinformatics were used to identify the features of adaptive evolution in Xenopus endonuclease G (EndoG) proteins. The multiple characteristics of EndoG isozymes were discovered to vary considerably between the two Xenopus species dwelling in different locations. Most notably, EndoG proteins from the psychrophilic X. laevis exhibit the increased contents of charged and polar residues, elevated pI, higher intramolecular interaction energies, B factors, molecular void volumes, and solvent accessibilities, but the decreased contents of nonpolar and aromatic amino acids, lower hydrophobicity, buried surface area, and molecular packing density compared to those from the thermophilic X. tropicalis. The observed differences strongly suggest that temperature plays a dominant role in EndoG diversification. Evaluation of intramolecular interaction energies appears to be a particularly sensitive and discriminative framework for assessing protein divergence at the structural level. Overall, this study highlights the diversification of homologous proteins in ectothermic vertebrate eukaryotes and provides mechanistic insight into protein adaptation to contrasting environments.
Cacheux, L.; Dutrillaux, B.; Gerbault-Seureau, M.; Nicolas, V.; Ponger, L.; Bed'Hom, B.; Escude, C.
Show abstract
BackgroundAlpha satellites, a superfamily of AT-rich tandem repeats, are the primary DNA component of centromeres in Platyrrhini and Catarrhini. Analyses of the human genome suggest that centromeres behave like biological ridges, with new alpha satellite families expanding at the centromere core, splitting and displacing older ones towards the pericentromeres. The Cercopithecini tribe, which displays an unusual chromosomal evolution involving multiple chromosomal fissions and centromere formations, represents a promising model to enhance our understanding of alpha satellite DNA evolutionary history. We previously applied targeted sequencing to centromere DNA from two distant species drawn from the Cercopithecini terrestrial and arboreal lineages, and characterized six alpha satellite families exhibiting varying mean sequence identities. MethodsCombining classical and molecular cytogenetics, we mapped the chromosomal distribution of these alpha satellite families across 13 Cercopithecini, one Papionini, and one Colobinae species. A nuclear marker-based phylogeny provided an evolutionary framework for interpretation. ResultsOur phylogeny identifies the terrestrial and arboreal lineages, and a newly designated swamp clade. We observed significant interspecies variations in alpha satellite patterns, including differences in presence/absence and distinct chromosomal distribution patterns (centromeric, pericentromeric, or subtelomeric). Families previously described as heterogeneous (83-87% mean sequence identity) exhibit a centromeric position in the swamp lineage, which is characterized by conserved karyotypes. In contrast, these families show a pericentromeric distribution in the terrestrial and arboreal lineages, replaced at the centromere core by more homogeneous families (95-98% mean sequence identity). In the arboreal clade, which is characterized by highly fissioned karyotypes, putative evolutionary new centromeres show a unique co-occurrence of highly homogeneous and heterogeneous families. Conclusion & ImplicationsWe propose a comprehensive evolutionary scenario for alpha satellite DNA in Cercopithecini, where younger families arise at the centromere core, shift toward the pericentromeres as they age, and eventually face extinction. Our study suggests that alpha satellite DNA and chromosomes evolve in an interdependent manner, with satellite diversification and displacement occurring in parallel with chromosome fissions and centromere repositioning. This comparative cytogenomic approach provides both support for the human-based evolutionary model for alpha satellite DNA and novel temporal insights into its diversification dynamics. Beyond evolutionary genomics, our findings highlight the potential of alpha satellite DNA to complement systematic studies in deciphering complex primate evolutionary histories.
Shaw, M. H.; DeMeis, J. D.; Arnold, C. A.; Cox, M. R.; Duong, T. C.; Gaviria, K. A.; McDavid, G. K.; Villegas, J. M.; Weimer, M. L.; Patil, S. S.; Alqudah, S. Y.; Borchert, G. M.
Show abstract
Long G4-rich regions (LG4s) are defined as DNA sequences containing a high density of guanine triplets capable of forming non-B DNA structures called G-quadruplexes (G4s). These regions frequently overlap with enhancers, which are regulatory DNA elements that modulate gene expression by interacting with DNA regions that dictate where transcription is initiated known as promoters. While LG4s have now been well-characterized in the human genome, neither LG4 occurrence, nor the ability of LG4s to function as enhancers, in other species has been described. To address this, we screened the genomes of 16 different species from various taxa to identify LG4s and then determined if they were conserved, and if so, if their regulatory capacity was similarly conserved. Our analyses characterized a number of previously unreported LG4s in the human genome as well as LG4s in 13 additional species. Of note, we identified a highly conserved LG4 enhancer predicted to regulate over 40 genes. This LG4 is embedded in the MAZ (Myc-Associated Zinc finger protein) locus, and we find this LG4 possesses the ability to directly interact with the same target promoter in both human and mouse. In summary, this work describes LG4s in the genomes of both unicellular and multicellular species including vertebrates, invertebrates, plants, and fungi. Furthermore, many of these LG4 sequences are highly conserved as is their regulatory capacity.
Varughese, S.; Huang, M.; Savige, J.
Show abstract
Autosomal dominant polycystic liver disease (ADPLD) commonly results from a pathogenic variant in one of 6 genes (GANAB, ALG8, LRP5, PRKCSH, SEC61B, SEC63). Pathogenic variants in these genes are also associated with kidney cysts, which rarely cause kidney failure, but the genes are included in cystic kidney panels. This study determined the population frequency of predicted pathogenic variants in the ADPLD genes in the general population. Variants for each gene were downloaded from gnomAD and annotated with ANNOVAR. The population frequencies were calculated from the number of people with predicted pathogenic variants in gnomAD v.2.1.1:loss-of-function structural and copy number; null; and rare, computationally-damaging missense changes that affected a conserved residue. Frequencies were also estimated from the number of gnomADv.4.1 variants assessed as Pathogenic or Likely pathogenic in ClinVar. Predicted pathogenic variants affected one in 95 people using our strategy and gnomAD v.2.1.1, and one in 151 with ClinVar assessments of gnomAD v.4.1 variants. LRP5 and ALG8 which are associated with a milder clinical phenotype, were the commonest affected genes with both strategies. Predicted pathogenic variants in ADPLD appear more frequent in admixed American (one in 100), Finnish (one in 107) and African/African American (one in 130) people (p all <0.0001 compared with Europeans (one in 197).Predicted pathogenic variants for ADPLD may be even more common because of additional unidentified causative genes. However not all ADPLD variants result in liver cysts, nor indeed cystic kidneys, because of incomplete penetrance and variable expressivity.
Marshall, A. T.; Kan, E.; Adise, S.; König, M.; McConnell, R.; Martinez, M.; Midya, V.; Arora, M.; Sowell, E. R.
Show abstract
Lead is a toxic metal ubiquitous in our environment. While dramatic reductions in lead sources have paralleled equivalent decreases in lead-poisoning rates, chronic lead exposure remains a critical public health concern. Childhood lead exposure (at its lowest levels) is liked to changes in cognitive development but less is known about lead's effects on children's brain structure, especially as a result of in utero exposure. We measured prenatal and early-postnatal lead exposure in shed deciduous teeth of 448 9- and 10-year-old children (from 20 United States cities) and linked those lead levels to childhood brain structure, cognition/behavior, and neighborhood- and family-level socioeconomic characteristics. Here we show negative associations between tooth-lead levels and the thickness of the brain's cortex, particularly in regions linked to language processing. With increasing tooth-lead levels, children of lower-income (versus higher-income) families showed steeper declines in receptive vocabulary. Caregiver-reported behavioral problems exhibited similar associations. With in utero exposure linked to adverse neurodevelopmental outcomes (well before lead exposure and its risks are evaluated by healthcare professionals), prenatal screening of maternal lead levels/exposure, coupled with recommended strategies to reduce its placental transmission, may help reduce lead's effects on future generations.
Gorbenko, I. V.; Scherbakov, D. Y.; Zverintseva, K. M.; Konstantinov, Y. M.
Show abstract
Short Interrupted Repeats Cassettes (SIRC) are recently discovered eukaryotic DNA elements possessing many traits of satellite DNA and mobile genetic elements, and consisted of short direct repeats interspersed with diverse spacer sequences. The SIRC ensemble of individual species is highly heterogenous and cannot be studied using alignment methods. It was found that number of similar SIRC sequences in a given pair of species is in general correlated with their taxonomic distance, and, at the same time, closely related species can possess very diverged SIRC ensembles, which makes SIRC evolutionary pattern closer to mobile genetic element type. The SIRC sequences make up clusters with comparable sequence patterns, that are likely to demonstrate doublet evolutionary model which strongly supports that the SIRC structure is supported by the evolutionary selection. Several SIRC sequences of Arabidopsis were found to be of ancient origin with traceable evolution history as far as to the moss clade. We carried out unbiased detection of SIRC ensembles in 10 plant genomes and found that, despite very high intraspecies heterogeneity, SIRC sets possess strong interspecies phylogenetic signal. Key messageShort Interrupted Repeats Cassettes are elements of ancient origin, and could potentially be used to trace organism history, and to facilitate syntheny and Hi-C analysis.
Wang, Z.; Raeihle, M.; Braun-Gorman, S.; Leung, I.; Richards, C.; Gabbay, L.; Shamoon-Pour, M.
Show abstract
Since the initial distribution of the SARS-CoV-19 vaccine, its widespread use has been hypothesized to act as a selective pressure that drives the COVID-19 virus to mutate. This study aims to investigate the correlation between global vaccination rates and the mutation rate of the SARS-CoV-2 Beta variant (B.1.351). From January to July 2021, nucleotide diversity increased in tandem with vaccination rates, demonstrating that the virus evolved more rapidly in response to selective pressure from mass vaccination. Statistical analysis revealed statistically significant positive correlations between both vaccination rates and vaccine doses administered with nucleotide diversity. Thus, our findings indicate a positive correlation between rising vaccination rates and nucleotide diversity, suggesting that increased vaccination coverage acted as a selective pressure that accelerated viral evolution of SARS CoV2.
Froukh, T.
Show abstract
Currently, the genetic architecture of Middle Eastern populations is underrepresented in global genomic databases. This gap increases the rate of Variants of Uncertain Significance (VUSs) and clinical misinterpretations of genomic data especially in Middle Eastern populations. Whole exome sequencing was conducted on 90 healthy individuals from Jordan and the data were analysed using Principal Component Analysis (PCA) and multi-computational filtering. PCA revealed a double ancestry (EUR-AFR) admixture rather than a triple admixture (EUR-AFR-AMR). More than 3,500 populations-specific variants (PSVs) were identified, of which 72% were singletons. Additionally, 19 variants were significantly enriched compared to the maximum allele frequencies in public global databases (Fisher's exact test with Benjamini-Hochberg false discovery rate correction, p-value < 0.05). Consequently, the results suggest the reclassification of variants of Uncertain Significance (VUS) which reside in the ECE2 gene to likely benign and the variants of Conflicting Classification of Pathogenicity in the genes IL1RN and THPO to benign based on the significant allele frequency (AF=0.0389, p-value < 0.05). Furthermore, a pathogenic ClinVar variant was identified in a healthy individual, warranting careful interpretation. The findings underscore the importance of identifying PSVs in order to minimize or even prevent clinical misdiagnosis and highlight the unique genetic signature in Jordan. The study serves as a foundational resource for precision medicine in the region.
Curtis, D.
Show abstract
UK Biobank has released whole genome sequence data for 500,000 participants, including allele counts for hundreds of millions of variants and these were considered in the context of the pentanucleotide background on which they occurred. Frequencies of singleton variants were obtained and compared with frequencies of more common variants. Results were highly correlated across chromosomes, reflecting systematic effects. C>T singleton variants were less frequent in the CG context but the opposite was true for more common variants, suggesting that they are relatively well tolerated and not subject to strong negative selection. The frequencies of singleton variant types were strongly influenced by their trinucleotide context and the total counts of variants in their trinucleotide context could be well approximated by combining five mutational signatures obtained from genomes of cancer cells. For some variant types, there were marked asymmetries in counts between plus and minus DNA strands. The patterns of these asymmetries for singleton variants differed between chromosomes, with five being negatively correlated with the rest. These asymmetries did not appear related to strand-specific gene content. It was noted that there were also strand asymmetries for some pentanucleotide sequences in the reference genome and that these were consistent across chromosomes. The sequence TTCGT is seen 673300 times on the plus strand but only 465807 times on the minus strand. These findings must reflect strand-specific mechanisms affecting mutation and selection which are not currently well understood and which could be investigated further. This research has been conducted using the UK Biobank Resource.
Lee, H.; Segae, V. S.; Garcia-Abadillo, J.; de Oliveira Bussiman, F.; Trujano Chavez, M. Z.; Hidalgo, J.; Jarquin, D.
Show abstract
Genotype-by-environment interaction (GEI) has been studied to identify environment-stable/favorable genotypes. The GEI simulation could help refine the inference by incorporating tangible factors such as genomic and environmental information. The Bayesian additive main effect and multiplicative interaction (Bayesian AMMI) model captures the genotype-specific responses across environments, reflecting directional relationships between genotypes and environments. Thus, we propose a Bayesian AMMI-based GEI simulation framework that utilizes high-throughput environmental covariance matrices to generate GEI effects with interpretable directional structure. To demonstrate the proposed approach, two simulated phenotypes were assessed under four levels of GEI variance. In the first simulation (Sim1), GEI effects were sampled from a multivariate normal distribution defined by the GEI matrix. In the second simulation (Sim2), GEI effects were generated by extending Sim1 with the Bayesian AMMI model. In both simulations, increasing GEI variance resulted in lower correlations of phenotypes across environments and stronger genotype-specific sensitivity to environmental variation. Across five cross-validation designs, models accounting for GEI consistently outperformed one that did not, with prediction accuracy generally decreasing as GEI variance increased. Clear distinctions between the two simulated phenotypes were evident from biplot analyses: Sim2 successfully captured environmental relatedness and genotype-specific responses, whereas such structure was absent in Sim1. These results demonstrate that the proposed Bayesian AMMI-based GEI simulation framework enables interpretable visualization of GEI and supports genomic selection strategies under complex environmental conditions.
Abrishamcar, S.; Eick, S. M.; Everson, T.; Suglia, S. F.; Fallin, M. D.; Wright, R. O.; Andra, S. S.; Chovatiya, J.; Jagani, R.; Barr, D. B.; Lussier, A. A.; Dunn, E. C.; MacIsaac, J. L.; Dever, K.; Kobor, M. S.; Hoffman, N.; Koen, N.; Zar, H. J.; Stein, D. J.; Hüls, A.
Show abstract
Background Prenatal exposure to pesticides and psychosocial factors often co-occurs, particularly in low- and middle-income settings, yet their joint effects on epigenetic age acceleration (EAA) in early life remain unknown. We investigated the joint associations of prenatal pesticides metabolites and psychosocial factors on EAA in the first five years of life in the South African Drakenstein Child Health Study. Methods In 643 mothers, we measured 11 urinary pesticide metabolites and seven psychosocial factors during the second trimester of pregnancy. Child DNA methylation was measured in whole blood at ages 1, 3, and 5 years. EAA was estimated using the Horvath, Skin & Blood Horvath (skinHorvath), and Wu epigenetic clocks. Longitudinal associations were estimated using generalized estimating equations, adjusted for confounders. Joint mixture associations were evaluated using weighted quantile sum regression (WQS) and quantile g-computation (QGCOMP). Results The joint prenatal exposure mixture was positively associated with Wu ({beta} per one quintile increase in the mixture [95% CI]: 0.41 years [0.15, 0.80]), skinHorvath (0.11 years [0.06, 0.16]), and Horvath EAA (0.31 years [0.20, 0.46]) over time using WQS. Psychosocial factors, particularly food insecurity, physical interpersonal violence, and stress biomarkers, contributed most to the total mixture effect for all clocks. Pyrethroid metabolites PBA and TDCCA were top pesticide contributors to Wu EAA. Pathway enrichment analyses of clock-specific CpGs revealed distinct biological architectures, with the Wu clock enriched for neurodevelopmental and immune pathways, and metabolic pathways for the Horvath clock. Discussion Joint prenatal exposure to pesticides and psychosocial factors was associated with increased EAA across early childhood, with psychosocial factors contributing the most to the total effect. These findings highlight the importance of assessing chemical and non-chemical stressors jointly and clock-specific biological interpretation in epigenetic aging research.
Petrin, A. L.; Keen, H. L.; Dunlay, L.; Xie, X. J.; Zeng, E.; Butali, A.; Wilcox, A.; Marazita, M. L.; Murray, J. C.; Moreno-Uribe, L.
Show abstract
Introduction: Nonsyndromic cleft lip with or without cleft palate (NSCL/P) is a common congenital malformation with complex etiology involving both genetic and environmental factors. Epigenetic mechanisms may mediate environmental contributions, but separating genetic from environmental effects remains challenging. Methods: We present an epigenome-wide association study with 32 monozygotic and 22 dizygotic twin pairs discordant for NSCL/P on blood and saliva samples. Differential methylation analysis was conducted using linear models to identify CpG sites showing significant methylation differences between affected and unaffected twins followed by functional annotation and pathway enrichment analysis. Results: The top-ranked finding is a differentially methylated region comprising two CpG sites at the CYP26A1 locus, cg12110262 (P = 3.21x10-7) and cg15055355 (P = 1.39x10-3). CYP26A1 is essential for retinoic acid catabolism and craniofacial patterning. The chromatin regulator ANKRD11, which causes KBG syndrome featuring cleft palate was the second best hit. Differentially methylated CpG sites showed significant enrichment in craniofacial enhancers and overlap with multiple GWAS-validated cleft genes including VAX1, PVRL1, SMAD3, and PRDM16. Conclusions: Our findings implicate retinoic acid signaling and chromatin regulation in NSCL/P etiology and demonstrate the value of discordant twin designs for distinguishing environmental from genetic epigenetic contributions to complex malformations.
Obaji, D.; Kim, J.; Olagbegi, Y.; Le, A.-T.; Ercan, S.
Show abstract
Transcription factor IIIC (TFIIIC) is a multi-subunit protein complex that recruits RNA polymerase III (Pol III) to the majority of its target genes. Evolutionarily conserved overlap between TFIIIC binding sites and the structural maintenance of chromosomes (SMC) complexes suggested a role for TFIIIC in SMC regulation and 3D organization of eukaryotic genomes; but the evidence has remained largely correlational due to the essential role of TFIIIC in RNA Pol III transcription. Here, we directly tested the function of TFIIIC in SMC complex regulation by using auxin inducible depletion in C. elegans. We performed Hi-C and ChIP-seq analyses upon acute depletion of TFTC-3, an essential TFIIIC subunit, and RPC-1, the catalytic subunit of RNA Pol III. Our results show that TFIIIC regulates the localization of two different types of SMC complexes, cohesin and condensin. TFIIIC is also required for increased 3D contacts between distant tRNA genes located on the same or different chromosomes. Depletion of individual SMC complexes did not significantly reduce the intrachromosomal tRNA gene contacts, suggesting redundancy or an independent mechanism mediating these contacts. Together, our study demonstrates an RNA Pol III independent function for TFIIIC, regulating binding of both cohesin and condensin, as well as the 3D organization of tRNA genes.
Gulka, A. O. D.; Kang, K. A.; Zhou, Z.; Gorkin, D. U.
Show abstract
BackgroundGene expression changes in response to developmental and environmental cues rely on cis-regulatory sequence elements (cREs). BRG1/BRM-Associated Factors (BAF) chromatin remodeling complexes maintain chromatin accessibility at many cREs, enabling binding by transcription factors (TFs). However, cREs exhibit a broad range of sensitivity to loss of BAF function, and the basis of this variability remains unknown. ResultsTo identify the characteristics of BAF-dependent cREs, we mapped chromatin accessibility changes following acute pharmacologic BAF inhibition in GM12878 lymphoblastoid cells. We integrated these results with over 100 TF and histone modification ChIP-seq datasets and used machine learning to identify features that predict chromatin accessibility changes. We found that Activator Protein 1 (AP-1) factors and lymphoid lineage-defining TFs including RUNX3 and PU.1 predicted BAF-dependence. Strikingly, we found that cREs bearing the chromatin signature of "primed" enhancers - enriched for H3K4me1 but lacking H3K27ac - were significantly more sensitive to BAF inhibition than typical active enhancers. As primed enhancers are known to facilitate transcriptional responses to stimuli, we tested the requirement of BAF activity in these responses. Acute BAF inhibition was sufficient to prevent both chromatin and transcriptional responses to interferon gamma and dexamethasone. cREs which normally gained accessibility in response to stimulation failed to do so with BAF inhibition, and these cREs were linked to genes with suppressed transcriptional induction. ConclusionsCollectively, our results demonstrate a requirement for continuous BAF activity to enable stimulus response and suggest that defective signal responsiveness may be a pathogenic mechanism in disease states caused by loss-of-function mutations in BAF subunits.
Belyakin, S. N.; Maksimov, D. A.; Pobedintseva, M. A.; Laktionov, P. P.; Mikhnevich, N. V.; Sipin, F. A.; Krylova, M. I.
Show abstract
Alleles of ASIP gene (Agouti locus) in dogs determine a wide spectrum of coat colors, from red to black. Gain-of-function Ay allele is the most dominant in the range of known ASIP mutations: when all other genes affecting coat pigmentation are intact, presence of Ay allele results in red coat color. Loss-of-function a allele is the most recessive allele of this gene. When homozygous, it gives black coat color. Usually, dogs with Ay/a genotype have red coat, because a single copy of Ay allele is sufficient to fully compensate for the non-functional allele a, implying the complete dominance in this pair of alleles. However exceptions are known. In the Hungarian Puli breed there is a specific coat pigmentation type called fako. We investigated the genetic composition of fako dogs and found evidence that the dominance of the Ay allele over the a allele may be incomplete in these dogs. Analysis of the MC1R gene that interacts with ASIP in the hair pigmentation genetic cascade allowed us to find the variants that may be responsible for the incomplete dominance of Ay allele over a allele in Hungarian Puli dogs.
Karu, N.; Zhao, H. N.; Batra, R.; Arnold, M.; Krumsiek, J.; David, L. C.; Barupal, D.; Schimmel, L.; Kueider-Paisley, A.; Blach, C.; Borkowski, K.; Dorrestein, P.; Bennett, D. A.; Kaddurah-Daouk, R.; Alzheimer's Disease Metabolomics Consortium,
Show abstract
INTRODUCTIONMounting evidence support exposome influences on brain function and health, complementing genome influences. Understanding the molecular imprint of exposome on brain metabolism and the biochemical communication between the body and brain can impact our fundamental understanding and treatment of neuropsychiatric diseases. METHODSLeveraging two complementary metabolomics platforms, we classified 1400 features in 514 brains from the ROSMAP collection. We evaluated the origin of these compounds using literature and databases. We correlated those metabolites with cognitive function using linear models. RESULTSWe identified over 230 non-endogenous compounds in the brain, including 103 drugs and metabolites, 120 dietary and microbial products and possibly 15 compounds from environmental exposures. Over 20 dietary and gut microbial compounds showed associations with cognition. DISCUSSIONComprehensive profiling of chemicals in the brain and the link to cognitive function provides foundational work to connect body and brain in the study of AD and related dementias.